Praznik: High performance information-based feature selection
نویسندگان
چکیده
Information filters are an important class of feature selection methods. They bring together strong theoretical background, execution speed and high quality. The paper provides a brief review said methods introduces praznik, R package collecting their efficient implementations. Its functionality is discussed with illustrative example analyses on established benchmark datasets, which also demonstrate the package’s competitive accuracy very computational efficiency in comparison to other tools. Moreover, praznik exposes its low-level functionality, aims support research development novel information theory based
منابع مشابه
CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
BACKGROUND The goal of feature selection is to select useful features and simultaneously exclude garbage features from a given dataset for classification purposes. This is expected to bring reduction of processing time and improvement of classification accuracy. METHODOLOGY In this study, we devised a new feature selection algorithm (CBFS) based on clearness of features. Feature clearness exp...
متن کاملInformation-based Feature Selection
Feature selection is a topic of great interest in applications dealing with high-dimensional datasets. These applications include gene expression array analysis, combinatorial chemistry and text processing of online documents. Using feature selection brings about several advantages. First, it leads to lower computational cost and time. Less memory is needed to store the data and less processing...
متن کاملCan high-order dependencies improve mutual information based feature selection?
Mutual information (MI) based approaches are a popular paradigm for feature selection. Most previous methods have made use of low-dimensional MI quantities that are only effective at detecting low-order dependencies between variables. Several works have considered the use of higher dimensional mutual information, but the theoretical underpinning of these approaches is not yet comprehensive. To ...
متن کاملFeature Selection based on Information Gain
The attribute reduction is one of the key processes for knowledge acquisition. Some data set is multidimensional and larger in size. If that data set is used for classification it may end with wrong results and it may also occupy more resources especially in terms of time. Most of the features present are redundant and inconsistent and affect the classification. In order to improve the efficien...
متن کاملInfosel++: Information Based Feature Selection C++ Library
A large package of algorithms for feature ranking and selection has been developed. Infosel++, Information Based Feature Selection C++ Library, is a collection of classes and utilities based on probability estimation that can help developers of machine learning methods in rapid interfacing of feature selection algorithms, aid users in selecting an appropriate algorithm for a given task (embed f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SoftwareX
سال: 2021
ISSN: ['2352-7110']
DOI: https://doi.org/10.1016/j.softx.2021.100819